lmSubsets: Exact Variable-Subset Selection in Linear Regression for R

نویسندگان
چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Group subset selection for linear regression

Two fast group subset selection (GSS) algorithms for the linear regression model are proposed in this paper. GSS finds the best combinations of groups up to a specified size minimising the residual sum of squares. This imposes an l0 constraint on the regression coefficients in a group context. It is a combinatorial optimisation problem with NP complexity. To make the exhaustive search very effi...

متن کامل

An Exact Implicit Enumeration Algorithm for Variable Selection in Multiple Linear Regression Models Using Information Criteria

For large multivariate data sets the data analyst often wants to know the best set of independent regressors to use in a multiple linear regression model. Akaike’s Information Criteria (AIC) is one information criterion calculated in SAS that is used to score a model. For a small number of independent variables p, an explicit enumeration of all possible 2 models is possible. However, for large ...

متن کامل

FWDselect: An R Package for Variable Selection in Regression Models

In multiple regression models, when there are a large number (p) of explanatory variables which may or may not be relevant for predicting the response, it is useful to be able to reduce the model. To this end, it is necessary to determine the best subset of q (q ≤ p) predictors which will establish the model with the best prediction capacity. FWDselect package introduces a new forward stepwiseb...

متن کامل

Variable selection in linear regression through adaptive penalty selection

Model selection procedures often use a fixed penalty, such as Mallows’ Cp, to avoid choosing a model which fits a particular data set extremely well. These procedures are often devised to give an unbiased risk estimate when a particular chosen model is used to predict future responses. As a correction for not including the variability induced in model selection, generalized degrees of freedom i...

متن کامل

Alternative Strategies for Variable Selection in Linear Regression Models

1. INTRODUCTION 1.1.1. Variable Selection for Incomplete Data sets In statistical practice, many real-life data sets are incomplete for reasons like non-responses or drop-outs. When a data set is incomplete, practitioners frequently resort to a " case-deletion " strategy within which the incomplete cases are excluded from analysis and the complete cases are formed into a reduced rectangular com...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Journal of Statistical Software

سال: 2020

ISSN: 1548-7660

DOI: 10.18637/jss.v093.i03